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Abstract — Image stitching is a technique which is used for 
attaining a high resolution panoramic image. In this technique, 
distinct aesthetic images that are imaged from different view and 
angles are combined together to produce a panoramic image. In 
the field of computer graphics, photographic and computer 
vision, Image stitching techniques are considered as current 
research areas. For obtaining a stitched image it becomes 
mandatory that one should have the knowledge of geometric 
relations among multiple image co-ordinate system [l].First, 
image stitching will be done based on feature key point matches. 
Final image with seam will be blended with image blending 
technique. Hence in this paper we are going to address multiple 
distinct techniques like some invariant features as Scale Invariant 
Feature Transform and Speeded up Robust Transform and 
Corner techniques as Harris Corner Detection Technique that 
are useful in sorting out the issues related with stitching of 
images. 

Keywords- SIFT; SURF; RANSAC; Harris Corner Detector; 
Panoramic Image. 

I. Introduction 

Image stitching is an artificial combination of series of 
distinct picturesque images. For stitching of an image some 
isolated transformations are required to perform through 
warping and merging operations. By merging the overlapping 
fields of warped image, we can easily achieve a tantamount 
image of same objects, which covers the visible area of scene 
completely. [2] Shortly after the photographic process was 
developed in 1839, the use of photographs was demonstrated 
on topographical mapping. In introductory years when the 
technology was not so urbanized on that time for creating the 
panorama ‘n’ number of cameras were required to capture the 
images which were positioned at poles apart locations and at 
divergent angles. As we know that for creating the a panoramic 
image, one should have the best knowledge about the 
geometrical relationships among the co-ordinate system of the 
images that are going to be stitched, but as in introductory 
period the geometric relations were not taken into consideration 
hence a good panoramic image was difficult to created. For 
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producing the panoramic image some steps are required that 
are shown as below diagrammatically. 

The below drawn figure shows the distinct steps that are to 
be taken into consideration for generation of panoramic image. 



Figure 1 . Flow Chart for Generating Panoramic Image 


As the images are acquired that are to be stitched, some 
preprocessing is required like projection of the images need to 
be made on surface that can be any mathematical surface like 
spherical and cylindrical. The combination of image 
registration and image merging helps in further processing of 
image stitching process. 

The main work of Image registration is to correlate two or 
more than two images that belong with the same scene. Here 
one image is considered as referenced image. The other images 
that are about to stitch undergoes for geometrical 
metamorphosis. Many apprehensions are investigated which 
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are responsible for wrong calibration of multiple images that 
are captured like if we are capturing the image from a dynamic 
platform or maybe there are some distortions in lens or sensors. 



Figure 2. Registration of Two MRI Images of Brain 



Figure 4. Wk is the m x n Window at location (a,b) in I k . 

By averaging the intersection of red, green and blue 
channels of k images in sequence of applied input image an 
image will be obtained given as I k . 


There are various application are available of image 
registration. The flow chart evolving panoramic image is 
shown below. 


No 



The variable k can be from I to the total no. of images in 
sequence of images. 

W k+ i (u, v)is an another window with the position (u,v). 

S k (u,v)is called as similarity measure with the position 
(u,v) and its value can be given as, 


Sfc (n, 2i=i 2y=i ik/c (tv) w^k+i(u,v) (by) I (i) 


Again the value of similarity measure is being calculated 
which is shown as S k (u*, v*) at the optimal matching position 
(u*,v*) and its value is given as, 


S k 


C u*,v *) = min 


{SkttJ)) 

l<i<H—M,l< j<L—M 


( 2 ) 


As the process of registration is being completed now we 
need to merge the images for producing a panoramic image 
hence process of merging is called image merging. 


II. Approaches of Image Stitching Algorithm 

There are various approaches used for stitching of image, 
here we are going to discuss only two of them. 


(i) Direct Approach: Direct techniques helps in reducing the 
summation of accurate differences between imbricating pixels 
of an image. In this method, each pixel value is compared with 
each other. These approaches are having conglomerate 
characteristics. In this approach, juxtaposition is made among 
all the pixel intensity values of images that are going to stitch. 
There are also some benefits of using this method given as 
follows. 


Figure 3. Flow Chart of Image Registration Technique for Generating 
Panoramic Image 

Where W k gives a mxn window that is defined on I k . 
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a) As in this method, each pixel value is compared with each 
other; hence contribution of each pixel is also measured. 

b) Direct techniques aims at using the existing data in image 
coalition optimally. 

(ii) Feature Based Approach: The compilation of image 
feature (extrema) points can be performed by measuring the 
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whole features of the existing image with the available images. 
Various distinct steps are used in feature based techniques for 
extraction of features, registration and blending. For feature 
extraction of several images we basically use number of 
techniques like Scale Invariant Feature Transform (SIFT), 
Speeded up Robust Feature (SURF) and Harris Corner 
Detection. 

As Scale Invariant Feature Transform (SIFT) technique has 
sturdiness but it is having low calculation speed hence it is not 
fine for real time applications. Speeded up Robust Feature 
(SURF) is more exceptional than SIFT and it produces better 
computational cost. For improving the computational cost an 
integral image is used. One more technique is introduced which 
is used for feature extraction is known as Harris Corner 
Detector. Harris Corner Detector is not invariant to scale 
changes and cross correlation [4]. 

III. Model of Image Stitching Technique 

In this section we will discuss about the model of image 
stitching technique. Some vital steps are to be held on for this. 

(i) Image acquisition 

(ii) Feature Detection and Matching 

(iii) Image Matching RANSAC Translation Estimation 

(iv) Global Alignment 

(v) Blending and Composition 



Figure 5. Image stitching model 


(i) Image Acquisition: Image acquisition is the primary stride 
that pacts with capturing an input image for engendering 
diverse separate panoramic images. The acquisition action can 
be performed in various ways like acquisition by camera 


rotations, acquisition by camera translation and acquisition by 
hand held camera. 


a) Acquisition by camera rotation: In this method a tripod is 
used and it is set at a particular chosen location. Its location 
remains constant throughout the whole acquisition process. 



Object of 
interest 


Figure 6. Acquisition by Camera Rotation using Tripod and Camera 


The geometry of overlapping images is as shown in figure 
given below 



Figure 7. Geometry of overlapping images 


The ratio of the width of the overlapping region to the 
width of the image can be calculated as, 


tan(a~) 

1 a 


(3) 


This can also be written as, 


tan(a-|) 

tan (f) 


(4) 


Where L = width of acquired image 

/ = the width of overlapping region between adjacent images. 
There are a number of advantages are illustrated of using the 
camera rotation for image acquisition some are listed as 
below, 

(i) In this method camera remains constant at single position 
for capturing a number of images. 

(ii) Measurement requirement is less and this can be executed 
smoothly. 

b) Acquisition by Camera Translation: In this proposed 
method, camera position does not remain same it keeps on 
moving parallel in direction of imaging plane. For this purpose 
camera is positioned onto a sliding plate. For capturing the 
image directly, the camera and sliding plates are implanted in 
front of object and it captures the images until a succession of 
image does not envelop the entire range. 
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Figure 8. Geometry for image acquisition for camera translation 


Where / = the width of overlapping region between adjacent 
images. 

L = length of acquired image, 
t = camera translation 
d = object of interest 

The ratio of the overlapping image to the whole image can be 
calculated as, 

- = 1 Ar (5) 

L 2 dtan(~) 

c) Acquisition using Hand Held Camera: Working with the 
handheld camera is very simple as just take the camera and 
start capturing the images either by staying at the same place 
only by rotating your body or moving along the directions of 
imaging plane. Bu the images captured by this method is 
difficult to stitch because of unnecessary camera rotations for 
acquiring the images. 

(ii) Feature Detection and Matching: The base of image 
stitching model lies under the two main steps as disclosure of 
feature and feature matching. It would be favorable for 
analyzing some important features of the images than going 
through the whole image. Multiple feature extraction 
techniques are used like Scale Invariant Feature Transform 
(SIFT), Speeded up Robust Feature (SURF) and Harris Corner 
Detection. 

Some leverage related with this approach is as shown below. 

a) This algorithm produces more robustness in opposition to 
scene movement. 

b) These algorithms are probably faster. 

(iii) RANSAC for Homography: Homography basically deals 
with relating the two images that are having the same planar 
surface in space. Homography is adopted for rectification of 
images, registration of images and analyzing the camera 
translation rotation and movement between two images. 
Homography is a 3 x 3 matrix M given as below, 




™12 

1*13 

M = 

n*2i 

m 22 

m 23 


.m 3 1 

m 32 

™ 3 3 . 


Homography relates the pixel co-ordinates in two images if 
x' = Mx 


Where x and x’ are the points given as x = (u, v, 1) in one 
image and x — (u\ v ',1) in another image. 


In this section, the parameter of homography is determined 
using Random Sample Consensus (RANSAC) algorithm. 
RANSAC loop involves selecting four feature pairs (at 
random); compute Homography H (exact); compute inliers, 
keep largest set of inliers, and finally it recomputed least- 
squares H estimate on all of the inliers [3]. RANSAC 
algorithm finds its goal by choosing a random subset of 
original data iteratively. RANSAC is used for adjusting the 
model existence of data outliners. Here we are going to 
discuss a fitting problem with parameter ‘x\ Following are the 
assumptions that are to taken into consideration for 
determining parameters. 

a) 6 N’ number of data items is used for determining the 
parameters. 

b) Total number of available data item is M. 

c) If in any situation algorithm fails to find out a good fit test 
then the probability if one exists is denoted as P fai i. 


The algorithm for RANSAC is given presented by using a 
flow chart. 



Figure 9. Flow Chart for RANSAC Algorithm 
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(iv) Global Alignment: in case that we are having the 
numerous images of same scene and want to combine all of 
them in an exact 3D reconstruction so for this purpose bundle 
adjustment is contemplate as the best technique. The aim of 
this step is to find a globally consistent set of alignment 
parameters that minimize the miss-registration between all 
pairs of images. Initial estimates of the 3D location of features 
in the scene must first be computed, as well as estimates of the 
camera locations [3]. Then, bundle adjustment applies an 
iterative algorithm to compute optimal values for the 3D 
reconstruction of the scene and camera positions, by 
minimizing the log-likelihood of the overall feature projection 
errors using a least-squares algorithm [6]. If we are given an 
unordered set of images to register, we need to discover which 
images go together to form one or more panoramas [5]. 

(v) Blending and Composition: The fundamental objective of 
blending as well as composition steps is to administer an 
attractive panoramic image. 

IV. Explanation Of Feature Extraction Techniques 

In field of image processing various feature extraction 
techniques are used some are given as below 

(i) Scale Invariant Feature Transform (SIFT) 

(ii) Speeded up Robust Feature (SURF) 

(iii) Harris Corner Detection 

(i) Scale Invariant Feature Transform (SIFT): This feature 
extraction technique was developed by D.G. Fowe in 2004. 
SIFT multi-scale feature relies on the Gaussian function to 
scale the image transformation into a single multi-scale image 
space, on which stable feature points can be extracted [7]. The 
below given equation is used in determining the scale space of 
an image, 

L{x,y,a) = G{x,y,a) * / (x, y) (7) 

Where, a represents the scale space factor. 

I (x,y) Represent applied input image. 

G(x,y, cr)Represent 2D Gaussian convolutional cord. 

The value of G(x,y, a) is as given, 

G(x,y,a ) = ^e-( x2+ ^ 2 ) /2ff2 (8) 

For efficiently detecting the key points in scale space, 
difference of Gaussian (DOG) of scale space is calculated as, 
D ( x , y, u) = (G(x,y,Ka) - G (x, y,a))* I (x, y) (9) 

This also can be written as, 

D(x,y,a ) = L(x,y, ka) — L{x,y, <r)(10) 



Figure 10. Local Maximum Detection in DoG Scale Space 

Scale Invariant Feature Transform can be given 
diagrammatically hence it is shown as below. 



Figure 1 1 . Flow Chart for SIFT 


In SIFT, Key points are first extracted by searching over all 
scales and image locations, then the descriptors defined on the 
key point neighborhood are computed, through to compare the 
Euclidean distance of their descriptors to extract the initial 
feature points pair, then eliminate spurious feature points pair 
by applying RANSAC, finally transform the input image with 
the correct mapping model for image fusion and complete 
image stitching [8]. 
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(ii) Speeded up Robust Feature (SURF): In the list of feature 
extraction techniques, Speeded up Robust Feature (SURF) is 
one of them. It is very famous algorithm. It was given by 
Herbert Bay et. al. in 2006. It can be used in various tasks like 
object recognition or 3D reconstruction. As SURF algorithm 
provides better result than SIFT and it is several times faster 
than SIFT. For detection of interest points, SURF uses an 
integer approximation of determinant of Hessian Blob 
Detector. Square shaped filter are used as an approximation. 


If we are using the integral image, then square shaped filters 
provides the best result. 

S(x,y) = Sf =0 Sj =0 /(i,y) (11) 

The SURF detector is based on the determinant of the 
Hessian matrix [9]. Let we have a point X = (. x,y ) in an 
image I, then Hassian matrix H(X, a ) at scale a in X can be 
calculated as, 


H(X,a) = 


^XX L X y ( X , cr) 

L xy C ^0 Lyy (X , o) 


( 12 ) 


Where, L xx (x,<r) is the convolution of the Gaussian second 

d 2 

order derivative — g(a) with image I in point X, and 

similarly for L xy (X, a ) and L yy (X, a) . Speeded up Robust 
Feature can also be shown diagrammatically in form of flow 
chart which is given as below, 



SURF algorithm has great advantages in the feature point 
extraction, main direction identification as well as feature 
vector alignment [10]. 


(iii) Harris Corner Detection: The Harris Corner Detection is a 
point feature extracting algorithm. This algorithm was 
provided by Chris Harris and MJ Stephens since 1988. The 
main reason for developing Harris Corner detector was to 
build up a local detecting corner feature in image. As this 
algorithm demands for high computation, apart from this it is 
broadly adopted. The algorithm for Harris Corner Detector is 
as given below. 

1 . Compute x and y derivaties of image 

I X = G**I 9 Iy = G**I (12) 


2. 

3. 

4. 

5. 

6. 


Compute products of derivatives at every pixel 

I X 2 — 4r4(’ Iy 2 ~ lyly > ^ xy ~ ^X'^y (13) 

Compute the sums of the products of derivatives at each 
pixel. 


$x2 ~ Gal * 4:2’ Sy2 
Define at each pixel 

H (x,y) = 


Gal ■ Iy2 > S X y . l X y 

(x, y) the matrix, 

S x2 (x,y) S xy (x, y) 

5xy (x,y) Sy 2 (x,y)\ 


(14) 

(15) 


Compute the response of detector at each pixel 

R = Det( H) - K(7Yace(H)) 2 (16) 

Threshold on value of R. compute non max suppression. 


The Harris Corner Detection technique is proposed to extract 
the corners which need not to set the threshold by manual and 
is insensitive to isolated point, noise and edge [11]. 

V. Conclusion 

In the field of computer vision image stitching is painstaking 
to be the best research area. It deals with distinct feature 
extraction algorithms. We have analyzed various algorithms 
with their merits and demerits as Scale Invariant Feature 
Transform (SIFT) is robust but it is not good for real time 
applications and having low calculation speed, Speeded up 
Robust Feature (SURF) has mediocre accuracy but it is 
slowest than any other algorithms and Harris Corner Detector 
algorithm is having the poor accuracy but it provides the good 
computational cost. Hence in future some other measures are 
to be taken into consideration for removing their demerits as 
well and also the video stitching is to be done to provide the 
dynamic panorama. 
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